Dataset info
| Number of variables | 29 |
|---|---|
| Number of observations | 21201 |
| Missing cells | 25015 (4.1%) |
| Duplicate rows | 0 (0.0%) |
| Total size in memory | 4.7 MiB |
| Average record size in memory | 232.0 B |
Variables types
| Numeric | 10 |
|---|---|
| Categorical | 9 |
| Boolean | 0 |
| Date | 0 |
| URL | 0 |
| Text (Unique) | 1 |
| Rejected | 9 |
| Unsupported | 0 |
Warnings
Arrival_at_Destination_-_Time only contains datetime values, but is categorical. Consider applying pd.to_datetime() | Type |
Arrival_at_Destination_-_Time has a high cardinality: 15725 distinct values | Warning |
Arrival_at_Pickup_-_Day_of_Month is highly correlated with Arrival_at_Destination_-_Day_of_Month (ρ = 1) | Rejected |
Arrival_at_Pickup_-_Time only contains datetime values, but is categorical. Consider applying pd.to_datetime() | Type |
Arrival_at_Pickup_-_Time has a high cardinality: 15767 distinct values | Warning |
Arrival_at_Pickup_-_Weekday_(Mo_=_1) is highly correlated with Arrival_at_Destination_-_Weekday_(Mo_=_1) (ρ = 1) | Rejected |
Confirmation_-_Day_of_Month is highly correlated with Arrival_at_Pickup_-_Day_of_Month (ρ = 1) | Rejected |
Confirmation_-_Time only contains datetime values, but is categorical. Consider applying pd.to_datetime() | Type |
Confirmation_-_Time has a high cardinality: 15742 distinct values | Warning |
Confirmation_-_Weekday_(Mo_=_1) is highly correlated with Arrival_at_Pickup_-_Weekday_(Mo_=_1) (ρ = 1) | Rejected |
Pickup_-_Day_of_Month is highly correlated with Confirmation_-_Day_of_Month (ρ = 1) | Rejected |
Pickup_-_Time only contains datetime values, but is categorical. Consider applying pd.to_datetime() | Type |
Pickup_-_Time has a high cardinality: 15690 distinct values | Warning |
Pickup_-_Weekday_(Mo_=_1) is highly correlated with Confirmation_-_Weekday_(Mo_=_1) (ρ = 1) | Rejected |
Placement_-_Day_of_Month is highly correlated with Pickup_-_Day_of_Month (ρ = 0.999998477) | Rejected |
Placement_-_Time only contains datetime values, but is categorical. Consider applying pd.to_datetime() | Type |
Placement_-_Time has a high cardinality: 15686 distinct values | Warning |
Placement_-_Weekday_(Mo_=_1) is highly correlated with Pickup_-_Weekday_(Mo_=_1) (ρ = 0.9999519962) | Rejected |
Precipitation_in_millimeters has 20649 (97.4%) missing values | Missing |
Rider_Id has a high cardinality: 924 distinct values | Warning |
Temperature has 4366 (20.6%) missing values | Missing |
User_Id has a high cardinality: 3186 distinct values | Warning |
Vehicle_Type has constant value "Bike" | Rejected |
Arrival_at_Destination_-_Day_of_Month
Numeric
| Distinct count | 31 |
|---|---|
| Unique (%) | 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 15.65383708 |
|---|---|
| Minimum | 1 |
| Maximum | 31 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| Median | 15 |
| Q3 | 23 |
| 95-th percentile | 30 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range | 15 |
Descriptive statistics
| Standard deviation | 8.798886375 |
|---|---|
| Coef of variation | 0.5620913472 |
| Kurtosis | -1.2074902 |
| Mean | 15.65383708 |
| MAD | 7.633175565 |
| Skewness | 0.09103408839 |
| Sum | 331877 |
| Variance | 77.42040143 |
| Memory size | 165.8 KiB |
| Value | Count | Frequency (%) | |
| 8 | 848 | 4.0% | |
| 7 | 822 | 3.9% | |
| 13 | 812 | 3.8% | |
| 14 | 804 | 3.8% | |
| 6 | 794 | 3.7% | |
| 28 | 784 | 3.7% | |
| 18 | 770 | 3.6% | |
| 4 | 769 | 3.6% | |
| 15 | 762 | 3.6% | |
| 11 | 751 | 3.5% | |
| Other values (21) | 13285 | 62.7% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 482 | 2.3% | |
| 2 | 602 | 2.8% | |
| 3 | 718 | 3.4% | |
| 4 | 769 | 3.6% | |
| 5 | 747 | 3.5% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 31 | 453 | 2.1% | |
| 30 | 714 | 3.4% | |
| 29 | 685 | 3.2% | |
| 28 | 784 | 3.7% | |
| 27 | 670 | 3.2% |
Arrival_at_Destination_-_Time
Categorical
| Distinct count | 15725 |
|---|---|
| Unique (%) | 74.2% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 3:24:03 PM | 7 |
|---|---|
| 1:08:03 PM | 6 |
| 11:16:33 AM | 6 |
| Other values (15722) |
| Value | Count | Frequency (%) | |
| 3:24:03 PM | 7 | < 0.1% | |
| 1:08:03 PM | 6 | < 0.1% | |
| 11:16:33 AM | 6 | < 0.1% | |
| 1:25:43 PM | 6 | < 0.1% | |
| 11:22:45 AM | 5 | < 0.1% | |
| 3:08:49 PM | 5 | < 0.1% | |
| 2:42:57 PM | 5 | < 0.1% | |
| 11:35:08 AM | 5 | < 0.1% | |
| 1:05:44 PM | 5 | < 0.1% | |
| 4:07:22 PM | 5 | < 0.1% | |
| Other values (15715) | 21146 | 99.7% |
| Max length | 11 |
|---|---|
| Mean length | 10.35017216 |
| Min length | 10 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
Arrival_at_Destination_-_Weekday_(Mo_=_1)
Numeric
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 3.240224518 |
|---|---|
| Minimum | 1 |
| Maximum | 7 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| Median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range | 3 |
Descriptive statistics
| Standard deviation | 1.567228303 |
|---|---|
| Coef of variation | 0.4836789223 |
| Kurtosis | -1.024853597 |
| Mean | 3.240224518 |
| MAD | 1.350351179 |
| Skewness | 0.1104716065 |
| Sum | 68696 |
| Variance | 2.456204553 |
| Memory size | 165.8 KiB |
| Value | Count | Frequency (%) | |
| 4 | 4229 | 19.9% | |
| 5 | 3993 | 18.8% | |
| 2 | 3959 | 18.7% | |
| 3 | 3823 | 18.0% | |
| 1 | 3788 | 17.9% | |
| 6 | 1223 | 5.8% | |
| 7 | 186 | 0.9% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 3788 | 17.9% | |
| 2 | 3959 | 18.7% | |
| 3 | 3823 | 18.0% | |
| 4 | 4229 | 19.9% | |
| 5 | 3993 | 18.8% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 7 | 186 | 0.9% | |
| 6 | 1223 | 5.8% | |
| 5 | 3993 | 18.8% | |
| 4 | 4229 | 19.9% | |
| 3 | 3823 | 18.0% |
Arrival_at_Pickup_-_Day_of_Month
Highly correlated
This variable is highly correlated with Arrival_at_Destination_-_Day_of_Month and should be ignored for analysis
| Correlation | 1 |
|---|
Arrival_at_Pickup_-_Time
Categorical
| Distinct count | 15767 |
|---|---|
| Unique (%) | 74.4% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 3:02:41 PM | 6 |
|---|---|
| 2:42:41 PM | 6 |
| 2:10:11 PM | 6 |
| Other values (15764) |
| Value | Count | Frequency (%) | |
| 3:02:41 PM | 6 | < 0.1% | |
| 2:42:41 PM | 6 | < 0.1% | |
| 2:10:11 PM | 6 | < 0.1% | |
| 1:02:53 PM | 6 | < 0.1% | |
| 2:32:04 PM | 6 | < 0.1% | |
| 2:52:44 PM | 5 | < 0.1% | |
| 3:33:04 PM | 5 | < 0.1% | |
| 3:05:00 PM | 5 | < 0.1% | |
| 10:22:40 AM | 5 | < 0.1% | |
| 9:56:09 AM | 5 | < 0.1% | |
| Other values (15757) | 21146 | 99.7% |
| Max length | 11 |
|---|---|
| Mean length | 10.37234093 |
| Min length | 10 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
Arrival_at_Pickup_-_Weekday_(Mo_=_1)
Highly correlated
This variable is highly correlated with Arrival_at_Destination_-_Weekday_(Mo_=_1) and should be ignored for analysis
| Correlation | 1 |
|---|
Confirmation_-_Day_of_Month
Highly correlated
This variable is highly correlated with Arrival_at_Pickup_-_Day_of_Month and should be ignored for analysis
| Correlation | 1 |
|---|
Confirmation_-_Time
Categorical
| Distinct count | 15742 |
|---|---|
| Unique (%) | 74.3% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 9:56:52 AM | 6 |
|---|---|
| 3:47:50 PM | 5 |
| 3:45:01 PM | 5 |
| Other values (15739) |
| Value | Count | Frequency (%) | |
| 9:56:52 AM | 6 | < 0.1% | |
| 3:47:50 PM | 5 | < 0.1% | |
| 3:45:01 PM | 5 | < 0.1% | |
| 11:21:38 AM | 5 | < 0.1% | |
| 3:20:53 PM | 5 | < 0.1% | |
| 10:18:08 AM | 5 | < 0.1% | |
| 2:28:37 PM | 5 | < 0.1% | |
| 11:07:48 AM | 5 | < 0.1% | |
| 12:02:59 PM | 5 | < 0.1% | |
| 3:48:00 PM | 5 | < 0.1% | |
| Other values (15732) | 21150 | 99.8% |
| Max length | 11 |
|---|---|
| Mean length | 10.37606717 |
| Min length | 10 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
Confirmation_-_Weekday_(Mo_=_1)
Highly correlated
This variable is highly correlated with Arrival_at_Pickup_-_Weekday_(Mo_=_1) and should be ignored for analysis
| Correlation | 1 |
|---|
Destination_Lat
Numeric
| Distinct count | 5302 |
|---|---|
| Unique (%) | 25.0% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | -1.282580838 |
|---|---|
| Minimum | -1.4302983 |
| Maximum | -1.0302254 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | -1.4302983 |
|---|---|
| 5-th percentile | -1.3366311 |
| Q1 | -1.3012008 |
| Median | -1.284382 |
| Q3 | -1.261177 |
| 95-th percentile | -1.225322 |
| Maximum | -1.0302254 |
| Range | 0.4000729 |
| Interquartile range | 0.0400238 |
Descriptive statistics
| Standard deviation | 0.03482356811 |
|---|---|
| Coef of variation | -0.02715116824 |
| Kurtosis | 2.266756264 |
| Mean | -1.282580838 |
| MAD | 0.02642191732 |
| Skewness | 0.1723612574 |
| Sum | -27191.99635 |
| Variance | 0.001212680896 |
| Memory size | 165.8 KiB |
| Value | Count | Frequency (%) | |
| -1.3004062 | 579 | 2.7% | |
| -1.2551895 | 529 | 2.5% | |
| -1.2638185 | 297 | 1.4% | |
| -1.265715 | 274 | 1.3% | |
| -1.2600926 | 269 | 1.3% | |
| -1.2571472 | 263 | 1.2% | |
| -1.2628473 | 263 | 1.2% | |
| -1.306378 | 231 | 1.1% | |
| -1.2991441 | 214 | 1.0% | |
| -1.2584143 | 204 | 1.0% | |
| Other values (5292) | 18078 | 85.3% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| -1.4302983 | 1 | < 0.1% | |
| -1.4287625 | 1 | < 0.1% | |
| -1.4242042 | 2 | < 0.1% | |
| -1.4238401 | 1 | < 0.1% | |
| -1.4196538 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| -1.0302254 | 1 | < 0.1% | |
| -1.0352624 | 1 | < 0.1% | |
| -1.0546336 | 2 | < 0.1% | |
| -1.0568484 | 1 | < 0.1% | |
| -1.0921599 | 5 | < 0.1% |
Destination_Long
Numeric
| Distinct count | 5267 |
|---|---|
| Unique (%) | 24.8% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 36.81122 |
|---|---|
| Minimum | 36.6065939 |
| Maximum | 37.0167793 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 36.6065939 |
|---|---|
| 5-th percentile | 36.7519845 |
| Q1 | 36.7856612 |
| Median | 36.8080021 |
| Q3 | 36.829477 |
| 95-th percentile | 36.897607 |
| Maximum | 37.0167793 |
| Range | 0.4101854 |
| Interquartile range | 0.0438158 |
Descriptive statistics
| Standard deviation | 0.04472064173 |
|---|---|
| Coef of variation | 0.001214864428 |
| Kurtosis | 1.340514535 |
| Mean | 36.81122 |
| MAD | 0.03251410604 |
| Skewness | 0.4078052915 |
| Sum | 780434.6751 |
| Variance | 0.001999935797 |
| Memory size | 165.8 KiB |
| Value | Count | Frequency (%) | |
| 36.829741 | 579 | 2.7% | |
| 36.7822034 | 529 | 2.5% | |
| 36.7930057 | 296 | 1.4% | |
| 36.823815 | 269 | 1.3% | |
| 36.8088685 | 269 | 1.3% | |
| 36.781805 | 263 | 1.2% | |
| 36.7950633 | 263 | 1.2% | |
| 36.7519845 | 231 | 1.1% | |
| 36.7528804 | 214 | 1.0% | |
| 36.8048002 | 204 | 1.0% | |
| Other values (5257) | 18084 | 85.3% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 36.6065939 | 1 | < 0.1% | |
| 36.6308896 | 1 | < 0.1% | |
| 36.6402664 | 1 | < 0.1% | |
| 36.6405079 | 1 | < 0.1% | |
| 36.6426562 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 37.0167793 | 1 | < 0.1% | |
| 37.0120302 | 2 | < 0.1% | |
| 37.0109232 | 1 | < 0.1% | |
| 37.0108117 | 1 | < 0.1% | |
| 37.0056922 | 1 | < 0.1% |
Distance_(KM)
Numeric
| Distinct count | 45 |
|---|---|
| Unique (%) | 0.2% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 9.506532711 |
|---|---|
| Minimum | 1 |
| Maximum | 49 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 5 |
| Median | 8 |
| Q3 | 13 |
| 95-th percentile | 20 |
| Maximum | 49 |
| Range | 48 |
| Interquartile range | 8 |
Descriptive statistics
| Standard deviation | 5.668962773 |
|---|---|
| Coef of variation | 0.5963228599 |
| Kurtosis | 1.458819669 |
| Mean | 9.506532711 |
| MAD | 4.480305835 |
| Skewness | 1.085025791 |
| Sum | 201548 |
| Variance | 32.13713893 |
| Memory size | 165.8 KiB |
| Value | Count | Frequency (%) | |
| 8 | 1973 | 9.3% | |
| 5 | 1929 | 9.1% | |
| 6 | 1726 | 8.1% | |
| 4 | 1675 | 7.9% | |
| 7 | 1672 | 7.9% | |
| 9 | 1463 | 6.9% | |
| 10 | 1162 | 5.5% | |
| 3 | 1111 | 5.2% | |
| 11 | 1030 | 4.9% | |
| 14 | 950 | 4.5% | |
| Other values (35) | 6510 | 30.7% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 307 | 1.4% | |
| 2 | 770 | 3.6% | |
| 3 | 1111 | 5.2% | |
| 4 | 1675 | 7.9% | |
| 5 | 1929 | 9.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 49 | 1 | < 0.1% | |
| 48 | 1 | < 0.1% | |
| 46 | 1 | < 0.1% | |
| 44 | 1 | < 0.1% | |
| 41 | 2 | < 0.1% |
Order_No
Categorical, Unique
| First 5 values |
|---|
| Order_No_1 |
| Order_No_10 |
| Order_No_100 |
| Order_No_1000 |
| Order_No_10000 |
| Last 5 values |
|---|
| Order_No_9992 |
| Order_No_9993 |
| Order_No_9995 |
| Order_No_9996 |
| Order_No_9997 |
First 5 values
| Value | Count | Frequency (%) | |
| Order_No_1 | 1 | < 0.1% | |
| Order_No_10 | 1 | < 0.1% | |
| Order_No_100 | 1 | < 0.1% | |
| Order_No_1000 | 1 | < 0.1% | |
| Order_No_10000 | 1 | < 0.1% |
Last 5 values
| Value | Count | Frequency (%) | |
| Order_No_9997 | 1 | < 0.1% | |
| Order_No_9996 | 1 | < 0.1% | |
| Order_No_9995 | 1 | < 0.1% | |
| Order_No_9993 | 1 | < 0.1% | |
| Order_No_9992 | 1 | < 0.1% |
Personal_or_Business
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Business | |
|---|---|
| Personal |
| Value | Count | Frequency (%) | |
| Business | 17384 | 82.0% | |
| Personal | 3817 | 18.0% |
| Max length | 8 |
|---|---|
| Mean length | 8 |
| Min length | 8 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
Pickup_-_Day_of_Month
Highly correlated
This variable is highly correlated with Confirmation_-_Day_of_Month and should be ignored for analysis
| Correlation | 1 |
|---|
Pickup_-_Time
Categorical
| Distinct count | 15690 |
|---|---|
| Unique (%) | 74.0% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 12:04:20 PM | 6 |
|---|---|
| 10:53:20 AM | 6 |
| 2:28:10 PM | 6 |
| Other values (15687) |
| Value | Count | Frequency (%) | |
| 12:04:20 PM | 6 | < 0.1% | |
| 10:53:20 AM | 6 | < 0.1% | |
| 2:28:10 PM | 6 | < 0.1% | |
| 11:19:28 AM | 6 | < 0.1% | |
| 3:14:23 PM | 5 | < 0.1% | |
| 4:15:10 PM | 5 | < 0.1% | |
| 3:01:12 PM | 5 | < 0.1% | |
| 1:10:24 PM | 5 | < 0.1% | |
| 3:22:34 PM | 5 | < 0.1% | |
| 12:10:43 PM | 5 | < 0.1% | |
| Other values (15680) | 21147 | 99.7% |
| Max length | 11 |
|---|---|
| Mean length | 10.37069006 |
| Min length | 10 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
Pickup_-_Weekday_(Mo_=_1)
Highly correlated
This variable is highly correlated with Confirmation_-_Weekday_(Mo_=_1) and should be ignored for analysis
| Correlation | 1 |
|---|
Pickup_Lat
Numeric
| Distinct count | 3666 |
|---|---|
| Unique (%) | 17.3% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | -1.281469693 |
|---|---|
| Minimum | -1.4383017 |
| Maximum | -1.1471704 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | -1.4383017 |
|---|---|
| 5-th percentile | -1.3302996 |
| Q1 | -1.300921 |
| Median | -1.279395 |
| Q3 | -1.2571472 |
| 95-th percentile | -1.2297202 |
| Maximum | -1.1471704 |
| Range | 0.2911313 |
| Interquartile range | 0.0437738 |
Descriptive statistics
| Standard deviation | 0.03050707731 |
|---|---|
| Coef of variation | -0.02380631979 |
| Kurtosis | 0.8191796651 |
| Mean | -1.281469693 |
| MAD | 0.0246201832 |
| Skewness | -0.2291906139 |
| Sum | -27168.43896 |
| Variance | 0.0009306817659 |
| Memory size | 165.8 KiB |
| Value | Count | Frequency (%) | |
| -1.2551895 | 2429 | 11.5% | |
| -1.2571472 | 629 | 3.0% | |
| -1.3167113 | 597 | 2.8% | |
| -1.300921 | 528 | 2.5% | |
| -1.3004062 | 510 | 2.4% | |
| -1.2584143 | 412 | 1.9% | |
| -1.272639 | 291 | 1.4% | |
| -1.290894 | 259 | 1.2% | |
| -1.279395 | 217 | 1.0% | |
| -1.273056 | 207 | 1.0% | |
| Other values (3656) | 15122 | 71.3% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| -1.4383017 | 1 | < 0.1% | |
| -1.4332561 | 1 | < 0.1% | |
| -1.428932 | 1 | < 0.1% | |
| -1.4242042 | 1 | < 0.1% | |
| -1.4226525 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| -1.1471704 | 1 | < 0.1% | |
| -1.1490105 | 1 | < 0.1% | |
| -1.1518113 | 1 | < 0.1% | |
| -1.1538233 | 1 | < 0.1% | |
| -1.15393 | 1 | < 0.1% |
Pickup_Long
Numeric
| Distinct count | 3656 |
|---|---|
| Unique (%) | 17.2% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 36.81126359 |
|---|---|
| Minimum | 36.653621 |
| Maximum | 36.9910462 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 36.653621 |
|---|---|
| 5-th percentile | 36.764868 |
| Q1 | 36.7846054 |
| Median | 36.8070399 |
| Q3 | 36.829741 |
| 95-th percentile | 36.8863135 |
| Maximum | 36.9910462 |
| Range | 0.3374252 |
| Interquartile range | 0.0451356 |
Descriptive statistics
| Standard deviation | 0.03747255524 |
|---|---|
| Coef of variation | 0.001017964383 |
| Kurtosis | 1.454746096 |
| Mean | 36.81126359 |
| MAD | 0.02816052045 |
| Skewness | 0.5462831696 |
| Sum | 780435.5994 |
| Variance | 0.001404192396 |
| Memory size | 165.8 KiB |
| Value | Count | Frequency (%) | |
| 36.7822034 | 2429 | 11.5% | |
| 36.7950633 | 629 | 3.0% | |
| 36.8301563 | 597 | 2.8% | |
| 36.828195 | 528 | 2.5% | |
| 36.829741 | 510 | 2.4% | |
| 36.8048002 | 412 | 1.9% | |
| 36.794723 | 291 | 1.4% | |
| 36.822971 | 259 | 1.2% | |
| 36.825364 | 216 | 1.0% | |
| 36.811298 | 207 | 1.0% | |
| Other values (3646) | 15123 | 71.3% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 36.653621 | 1 | < 0.1% | |
| 36.6538278 | 1 | < 0.1% | |
| 36.6647806 | 1 | < 0.1% | |
| 36.6654506 | 1 | < 0.1% | |
| 36.6664253 | 1 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 36.9910462 | 1 | < 0.1% | |
| 36.9806911 | 1 | < 0.1% | |
| 36.9759716 | 1 | < 0.1% | |
| 36.967084 | 13 | 0.1% | |
| 36.9647412 | 1 | < 0.1% |
Placement_-_Day_of_Month
Highly correlated
This variable is highly correlated with Pickup_-_Day_of_Month and should be ignored for analysis
| Correlation | 0.999998477 |
|---|
Placement_-_Time
Categorical
| Distinct count | 15686 |
|---|---|
| Unique (%) | 74.0% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 2:57:22 PM | 6 |
|---|---|
| 10:31:43 AM | 6 |
| 2:24:11 PM | 6 |
| Other values (15683) |
| Value | Count | Frequency (%) | |
| 2:57:22 PM | 6 | < 0.1% | |
| 10:31:43 AM | 6 | < 0.1% | |
| 2:24:11 PM | 6 | < 0.1% | |
| 9:41:03 AM | 6 | < 0.1% | |
| 12:51:03 PM | 6 | < 0.1% | |
| 2:06:19 PM | 6 | < 0.1% | |
| 11:45:52 AM | 5 | < 0.1% | |
| 2:26:53 PM | 5 | < 0.1% | |
| 2:39:54 PM | 5 | < 0.1% | |
| 2:30:43 PM | 5 | < 0.1% | |
| Other values (15676) | 21145 | 99.7% |
| Max length | 11 |
|---|---|
| Mean length | 10.37988774 |
| Min length | 10 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
Placement_-_Weekday_(Mo_=_1)
Highly correlated
This variable is highly correlated with Pickup_-_Weekday_(Mo_=_1) and should be ignored for analysis
| Correlation | 0.9999519962 |
|---|
Platform_Type
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 3 | |
|---|---|
| 1 | 2147 |
| 2 | 980 |
| Value | Count | Frequency (%) | |
| 3 | 18054 | 85.2% | |
| 1 | 2147 | 10.1% | |
| 2 | 980 | 4.6% | |
| 4 | 20 | 0.1% |
| Max length | 1 |
|---|---|
| Mean length | 1 |
| Min length | 1 |
| Contains chars | False |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | False |
Precipitation_in_millimeters
Numeric
| Distinct count | 55 |
|---|---|
| Unique (%) | 0.3% |
| Missing (%) | 97.4% |
| Missing (n) | 20649 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 7.905797101 |
|---|---|
| Minimum | 0.1 |
| Maximum | 99.1 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 0.1 |
|---|---|
| 5-th percentile | 0.3 |
| Q1 | 1.075 |
| Median | 2.9 |
| Q3 | 4.9 |
| 95-th percentile | 50.945 |
| Maximum | 99.1 |
| Range | 99 |
| Interquartile range | 3.825 |
Descriptive statistics
| Standard deviation | 17.08997124 |
|---|---|
| Coef of variation | 2.16170122 |
| Kurtosis | 15.73944046 |
| Mean | 7.905797101 |
| MAD | 9.058433102 |
| Skewness | 3.908902069 |
| Sum | 4364 |
| Variance | 292.067117 |
| Memory size | 165.8 KiB |
| Value | Count | Frequency (%) | |
| 2 | 39 | 0.2% | |
| 1.9 | 37 | 0.2% | |
| 0.9 | 35 | 0.2% | |
| 1.1 | 34 | 0.2% | |
| 3 | 29 | 0.1% | |
| 3.1 | 27 | 0.1% | |
| 2.1 | 25 | 0.1% | |
| 2.9 | 22 | 0.1% | |
| 1 | 21 | 0.1% | |
| 3.9 | 19 | 0.1% | |
| Other values (44) | 264 | 1.2% | |
| (Missing) | 20649 | 97.4% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0.1 | 6 | < 0.1% | |
| 0.2 | 9 | < 0.1% | |
| 0.3 | 18 | 0.1% | |
| 0.4 | 12 | 0.1% | |
| 0.5 | 8 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 99.1 | 3 | < 0.1% | |
| 99 | 1 | < 0.1% | |
| 98.9 | 5 | < 0.1% | |
| 83.1 | 2 | < 0.1% | |
| 83 | 2 | < 0.1% |
Rider_Id
Categorical
| Distinct count | 924 |
|---|---|
| Unique (%) | 4.4% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Rider_Id_726 | 247 |
|---|---|
| Rider_Id_576 | 223 |
| Rider_Id_523 | 199 |
| Other values (921) |
| Value | Count | Frequency (%) | |
| Rider_Id_726 | 247 | 1.2% | |
| Rider_Id_576 | 223 | 1.1% | |
| Rider_Id_523 | 199 | 0.9% | |
| Rider_Id_101 | 183 | 0.9% | |
| Rider_Id_205 | 182 | 0.9% | |
| Rider_Id_882 | 172 | 0.8% | |
| Rider_Id_427 | 168 | 0.8% | |
| Rider_Id_116 | 158 | 0.7% | |
| Rider_Id_88 | 152 | 0.7% | |
| Rider_Id_103 | 151 | 0.7% | |
| Other values (914) | 19366 | 91.3% |
| Max length | 12 |
|---|---|
| Mean length | 11.8805245 |
| Min length | 10 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | False |
Temperature
Numeric
| Distinct count | 189 |
|---|---|
| Unique (%) | 0.9% |
| Missing (%) | 20.6% |
| Missing (n) | 4366 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 23.25888922 |
|---|---|
| Minimum | 11.2 |
| Maximum | 32.1 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 11.2 |
|---|---|
| 5-th percentile | 17.3 |
| Q1 | 20.6 |
| Median | 23.5 |
| Q3 | 26 |
| 95-th percentile | 28.73 |
| Maximum | 32.1 |
| Range | 20.9 |
| Interquartile range | 5.4 |
Descriptive statistics
| Standard deviation | 3.615768283 |
|---|---|
| Coef of variation | 0.155457479 |
| Kurtosis | -0.5743438517 |
| Mean | 23.25888922 |
| MAD | 2.991490797 |
| Skewness | -0.1536427098 |
| Sum | 391563.4 |
| Variance | 13.07378028 |
| Memory size | 165.8 KiB |
| Value | Count | Frequency (%) | |
| 24.7 | 201 | 0.9% | |
| 22.4 | 196 | 0.9% | |
| 23.8 | 195 | 0.9% | |
| 23.7 | 191 | 0.9% | |
| 24.6 | 189 | 0.9% | |
| 23.6 | 188 | 0.9% | |
| 25.2 | 184 | 0.9% | |
| 22.5 | 183 | 0.9% | |
| 27.3 | 179 | 0.8% | |
| 24.8 | 178 | 0.8% | |
| Other values (178) | 14951 | 70.5% | |
| (Missing) | 4366 | 20.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 11.2 | 1 | < 0.1% | |
| 13.2 | 5 | < 0.1% | |
| 13.3 | 6 | < 0.1% | |
| 13.4 | 5 | < 0.1% | |
| 13.5 | 4 | < 0.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 32.1 | 4 | < 0.1% | |
| 32 | 2 | < 0.1% | |
| 31.9 | 2 | < 0.1% | |
| 31.8 | 6 | < 0.1% | |
| 31.7 | 3 | < 0.1% |
Time_from_Pickup_to_Arrival
Numeric
| Distinct count | 4067 |
|---|---|
| Unique (%) | 19.2% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1556.920947 |
|---|---|
| Minimum | 1 |
| Maximum | 7883 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 285 |
| Q1 | 882 |
| Median | 1369 |
| Q3 | 2040 |
| 95-th percentile | 3499 |
| Maximum | 7883 |
| Range | 7882 |
| Interquartile range | 1158 |
Descriptive statistics
| Standard deviation | 987.2707879 |
|---|---|
| Coef of variation | 0.6341174802 |
| Kurtosis | 2.236351987 |
| Mean | 1556.920947 |
| MAD | 749.1729935 |
| Skewness | 1.201937919 |
| Sum | 33008281 |
| Variance | 974703.6086 |
| Memory size | 165.8 KiB |
| Value | Count | Frequency (%) | |
| 2 | 143 | 0.7% | |
| 3 | 97 | 0.5% | |
| 4 | 66 | 0.3% | |
| 5 | 56 | 0.3% | |
| 7 | 37 | 0.2% | |
| 1 | 37 | 0.2% | |
| 6 | 36 | 0.2% | |
| 8 | 29 | 0.1% | |
| 10 | 25 | 0.1% | |
| 1424 | 24 | 0.1% | |
| Other values (4057) | 20651 | 97.4% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 37 | 0.2% | |
| 2 | 143 | 0.7% | |
| 3 | 97 | 0.5% | |
| 4 | 66 | 0.3% | |
| 5 | 56 | 0.3% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 7883 | 1 | < 0.1% | |
| 7714 | 1 | < 0.1% | |
| 7646 | 1 | < 0.1% | |
| 7491 | 1 | < 0.1% | |
| 7387 | 1 | < 0.1% |
User_Id
Categorical
| Distinct count | 3186 |
|---|---|
| Unique (%) | 15.0% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| User_Id_393 | 645 |
|---|---|
| User_Id_2330 | 322 |
| User_Id_3647 | 312 |
| Other values (3183) |
| Value | Count | Frequency (%) | |
| User_Id_393 | 645 | 3.0% | |
| User_Id_2330 | 322 | 1.5% | |
| User_Id_3647 | 312 | 1.5% | |
| User_Id_1500 | 301 | 1.4% | |
| User_Id_635 | 290 | 1.4% | |
| User_Id_868 | 278 | 1.3% | |
| User_Id_3291 | 276 | 1.3% | |
| User_Id_3283 | 268 | 1.3% | |
| User_Id_136 | 211 | 1.0% | |
| User_Id_1329 | 208 | 1.0% | |
| Other values (3176) | 18090 | 85.3% |
| Max length | 12 |
|---|---|
| Mean length | 11.67492099 |
| Min length | 9 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | False |
Vehicle_Type
Constant
This variable is constant and should be ignored for analysis
| Constant value | Bike |
|---|
First rows
| Arrival_at_Destination_-_Day_of_Month | Arrival_at_Destination_-_Time | Arrival_at_Destination_-_Weekday_(Mo_=_1) | Arrival_at_Pickup_-_Day_of_Month | Arrival_at_Pickup_-_Time | Arrival_at_Pickup_-_Weekday_(Mo_=_1) | Confirmation_-_Day_of_Month | Confirmation_-_Time | Confirmation_-_Weekday_(Mo_=_1) | Destination_Lat | Destination_Long | Distance_(KM) | Order_No | Personal_or_Business | Pickup_-_Day_of_Month | Pickup_-_Time | Pickup_-_Weekday_(Mo_=_1) | Pickup_Lat | Pickup_Long | Placement_-_Day_of_Month | Placement_-_Time | Placement_-_Weekday_(Mo_=_1) | Platform_Type | Precipitation_in_millimeters | Rider_Id | Temperature | Time_from_Pickup_to_Arrival | User_Id | Vehicle_Type | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 9 | 10:39:55 AM | 5 | 9 | 10:04:47 AM | 5 | 9 | 9:40:10 AM | 5 | -1.300406 | 36.829741 | 4 | Order_No_4211 | Business | 9 | 10:27:30 AM | 5 | -1.317755 | 36.830370 | 9 | 9:35:46 AM | 5 | 3 | NaN | Rider_Id_432 | 20.4 | 745 | User_Id_633 | Bike |
| 1 | 12 | 12:17:22 PM | 5 | 12 | 11:40:22 AM | 5 | 12 | 11:23:21 AM | 5 | -1.295004 | 36.814358 | 16 | Order_No_25375 | Personal | 12 | 11:44:09 AM | 5 | -1.351453 | 36.899315 | 12 | 11:16:16 AM | 5 | 3 | NaN | Rider_Id_856 | 26.4 | 1993 | User_Id_2285 | Bike |
| 2 | 30 | 1:00:38 PM | 2 | 30 | 12:49:34 PM | 2 | 30 | 12:42:44 PM | 2 | -1.300921 | 36.828195 | 3 | Order_No_1899 | Business | 30 | 12:53:03 PM | 2 | -1.308284 | 36.843419 | 30 | 12:39:25 PM | 2 | 3 | NaN | Rider_Id_155 | NaN | 455 | User_Id_265 | Bike |
| 3 | 15 | 10:05:27 AM | 5 | 15 | 9:37:56 AM | 5 | 15 | 9:26:05 AM | 5 | -1.257147 | 36.795063 | 9 | Order_No_9336 | Business | 15 | 9:43:06 AM | 5 | -1.281301 | 36.832396 | 15 | 9:25:34 AM | 5 | 3 | NaN | Rider_Id_855 | 19.2 | 1341 | User_Id_1402 | Bike |
| 4 | 13 | 10:25:37 AM | 1 | 13 | 10:03:53 AM | 1 | 13 | 9:56:18 AM | 1 | -1.295041 | 36.809817 | 9 | Order_No_27883 | Personal | 13 | 10:05:23 AM | 1 | -1.266597 | 36.792118 | 13 | 9:55:18 AM | 1 | 1 | NaN | Rider_Id_770 | 15.4 | 1214 | User_Id_1737 | Bike |
| 5 | 14 | 4:23:41 PM | 5 | 14 | 3:21:36 PM | 5 | 14 | 3:08:57 PM | 5 | -1.257309 | 36.806008 | 9 | Order_No_7408 | Business | 14 | 3:30:30 PM | 5 | -1.302583 | 36.767081 | 14 | 3:07:35 PM | 5 | 3 | NaN | Rider_Id_124 | 27.2 | 3191 | User_Id_1342 | Bike |
| 6 | 9 | 10:19:45 AM | 5 | 9 | 9:53:12 AM | 5 | 9 | 9:49:47 AM | 5 | -1.276574 | 36.851365 | 5 | Order_No_22680 | Business | 9 | 9:56:45 AM | 5 | -1.279395 | 36.825364 | 9 | 9:33:45 AM | 5 | 3 | NaN | Rider_Id_114 | 20.3 | 1380 | User_Id_2803 | Bike |
| 7 | 11 | 2:33:26 PM | 1 | 11 | 2:21:33 PM | 1 | 11 | 2:14:13 PM | 1 | -1.255189 | 36.782203 | 3 | Order_No_21578 | Business | 11 | 2:22:40 PM | 1 | -1.252796 | 36.800313 | 11 | 2:13:01 PM | 1 | 3 | NaN | Rider_Id_913 | 28.7 | 646 | User_Id_1075 | Bike |
| 8 | 30 | 1:19:35 PM | 2 | 30 | 12:13:18 PM | 2 | 30 | 11:15:49 AM | 2 | -1.300255 | 36.825657 | 9 | Order_No_5234 | Business | 30 | 12:22:57 PM | 2 | -1.255189 | 36.782203 | 30 | 11:10:44 AM | 2 | 3 | NaN | Rider_Id_394 | NaN | 3398 | User_Id_733 | Bike |
| 9 | 23 | 6:31:57 PM | 5 | 23 | 5:32:41 PM | 5 | 23 | 5:17:56 PM | 5 | -1.215601 | 36.891686 | 14 | Order_No_1768 | Business | 23 | 5:34:38 PM | 5 | -1.225322 | 36.808550 | 23 | 4:48:54 PM | 5 | 3 | NaN | Rider_Id_660 | 24.6 | 3439 | User_Id_2112 | Bike |
Last rows
| Arrival_at_Destination_-_Day_of_Month | Arrival_at_Destination_-_Time | Arrival_at_Destination_-_Weekday_(Mo_=_1) | Arrival_at_Pickup_-_Day_of_Month | Arrival_at_Pickup_-_Time | Arrival_at_Pickup_-_Weekday_(Mo_=_1) | Confirmation_-_Day_of_Month | Confirmation_-_Time | Confirmation_-_Weekday_(Mo_=_1) | Destination_Lat | Destination_Long | Distance_(KM) | Order_No | Personal_or_Business | Pickup_-_Day_of_Month | Pickup_-_Time | Pickup_-_Weekday_(Mo_=_1) | Pickup_Lat | Pickup_Long | Placement_-_Day_of_Month | Placement_-_Time | Placement_-_Weekday_(Mo_=_1) | Platform_Type | Precipitation_in_millimeters | Rider_Id | Temperature | Time_from_Pickup_to_Arrival | User_Id | Vehicle_Type | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 21191 | 1 | 3:39:55 PM | 4 | 1 | 2:51:53 PM | 4 | 1 | 2:10:27 PM | 4 | -1.215601 | 36.891686 | 13 | Order_No_16262 | Business | 1 | 2:53:08 PM | 4 | -1.282142 | 36.816156 | 1 | 2:09:37 PM | 4 | 3 | NaN | Rider_Id_736 | NaN | 2807 | User_Id_1231 | Bike |
| 21192 | 23 | 1:21:22 PM | 2 | 23 | 12:44:19 PM | 2 | 23 | 12:41:52 PM | 2 | -1.308327 | 36.849476 | 7 | Order_No_11670 | Business | 23 | 1:05:11 PM | 2 | -1.316711 | 36.830156 | 23 | 12:41:17 PM | 2 | 3 | NaN | Rider_Id_571 | NaN | 971 | User_Id_1146 | Bike |
| 21193 | 12 | 5:56:53 PM | 6 | 12 | 5:06:48 PM | 6 | 12 | 4:49:51 PM | 6 | -1.300106 | 36.758455 | 10 | Order_No_4988 | Personal | 12 | 5:27:22 PM | 6 | -1.286568 | 36.825335 | 12 | 4:48:16 PM | 6 | 1 | NaN | Rider_Id_103 | 23.5 | 1771 | User_Id_875 | Bike |
| 21194 | 15 | 5:53:33 PM | 1 | 15 | 5:08:13 PM | 1 | 15 | 5:02:52 PM | 1 | -1.319862 | 36.711032 | 16 | Order_No_865 | Business | 15 | 5:15:15 PM | 1 | -1.260234 | 36.799055 | 15 | 5:02:09 PM | 1 | 3 | NaN | Rider_Id_177 | NaN | 2298 | User_Id_1245 | Bike |
| 21195 | 2 | 2:33:21 PM | 6 | 2 | 1:52:11 PM | 6 | 2 | 1:25:40 PM | 6 | -1.276549 | 36.766981 | 17 | Order_No_9932 | Business | 2 | 1:54:36 PM | 6 | -1.238406 | 36.871870 | 2 | 1:08:34 PM | 6 | 3 | NaN | Rider_Id_34 | 29.0 | 2325 | User_Id_3582 | Bike |
| 21196 | 20 | 4:20:17 PM | 3 | 20 | 3:58:49 PM | 3 | 20 | 3:55:09 PM | 3 | -1.275285 | 36.802702 | 3 | Order_No_8834 | Personal | 20 | 4:20:08 PM | 3 | -1.258414 | 36.804800 | 20 | 3:54:38 PM | 3 | 3 | NaN | Rider_Id_953 | 28.6 | 9 | User_Id_2001 | Bike |
| 21197 | 13 | 10:46:17 AM | 6 | 13 | 10:20:04 AM | 6 | 13 | 10:13:41 AM | 6 | -1.331619 | 36.847976 | 7 | Order_No_22892 | Business | 13 | 10:33:27 AM | 6 | -1.307143 | 36.825009 | 13 | 10:13:34 AM | 6 | 3 | NaN | Rider_Id_155 | 26.0 | 770 | User_Id_1796 | Bike |
| 21198 | 7 | 6:40:05 PM | 4 | 7 | 5:30:17 PM | 4 | 7 | 5:07:09 PM | 4 | -1.258414 | 36.804800 | 20 | Order_No_2831 | Business | 7 | 5:50:52 PM | 4 | -1.286018 | 36.897534 | 7 | 5:06:16 PM | 4 | 3 | NaN | Rider_Id_697 | 29.2 | 2953 | User_Id_2956 | Bike |
| 21199 | 4 | 10:08:15 AM | 3 | 4 | 9:38:59 AM | 3 | 4 | 9:31:53 AM | 3 | -1.279209 | 36.794872 | 13 | Order_No_6174 | Personal | 4 | 9:45:15 AM | 3 | -1.250030 | 36.874167 | 4 | 9:31:39 AM | 3 | 1 | NaN | Rider_Id_347 | 15.0 | 1380 | User_Id_2524 | Bike |
| 21200 | 26 | 3:17:23 PM | 2 | 26 | 2:24:29 PM | 2 | 26 | 2:20:01 PM | 2 | -1.320157 | 36.830887 | 12 | Order_No_9836 | Business | 26 | 2:41:55 PM | 2 | -1.255189 | 36.782203 | 26 | 2:19:47 PM | 2 | 3 | NaN | Rider_Id_177 | 30.9 | 2128 | User_Id_718 | Bike |